Sequencing near infrared and infrared fluorescence labeled DNA for detecting using laser diodes and suitable labels therefor

ABSTRACT

To sequence DNA automatically, DNA marked with far infrared, near infrared, or infrared fluorescent dyes are electrophoresed in a plurality of channels through a gel electrophoresis slab or capillary tubes wherein the DNA samples are resolved in accordance with the size of DNA fragments in the gel electrophoresis slab or capillary tubes into fluorescently marked DNA bands. The separated samples are scanned photoelectrically with a laser diode and a sensor, wherein the laser scans with scanning light at a wavelength within the absorbance spectrum of said fluorescently marked DNA samples and light is sensed at the emission wavelength of the marked DNA.

REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part of U.S. patent application Ser. No. 08/288,461, filed Aug. 10, 1994, now U.S. Pat. No. 5,534,125, which is a divisional of U.S. patent application Ser. No. 08/018,806, filed Feb. 17, 1993, now U.S. Pat. No. 5,360,523, which is a continuation-in-part of U.S. patent application Ser. No. 07/763,230, filed Sep. 20, 1991, now U.S. Pat. No. 5,230,781, which is a continuation-in-part of U.S. patent application Ser. No. 07/570,503, filed Aug. 21, 1990, now U.S. Pat. No. 5,207,880, which is a continuation-in-part of U.S. patent application Ser. No. 07/078,279, filed Jul. 27, 1987, now abandoned, which is a divisional of U.S. patent application Ser. No. 06/594,676, filed Mar. 29, 1984, now U.S. Pat. No. 4,729,947, and this application is also a continuation-in-part of U.S. patent application Ser. No. 08/204,627, filed Mar. 1, 1994, now U.S. Pat. No. 5,571,388, which is a continuation-in-part of U.S. patent application Ser. No. 07/860,140, filed Mar. 30, 1992, now U.S. Pat. No. 5,366,603, which is a divisional of U.S. patent application Ser. No. 07/763,230, filed Sep. 20, 1991, now U.S. Pat. No. 5,230,781, and this application is also a continuation-in-part of U.S. patent application Ser. No. 08/275,232, filed Jul. 14, 1994, now abandoned, which is a divisional of U.S. patent application Ser. No. 07/950,734, filed Sep. 24, 1992, now U.S. Pat. No. 5,346,603, which is a continuation of U.S. patent application Ser. No. 07/799,712, filed Nov. 26, 1991, now abandoned, which is a continuation of U.S. patent application Ser. No. 07/632,605, filed Dec. 24, 1990, now abandoned, which is a continuation of U.S. patent application Ser. No. 07/078,279, filed July 27, 1987, now abandoned.

BACKGROUND OF THE INVENTION

This invention relates to the sequencing of fluorescence labeled DNA and the detecting of the DNA after irradiation by light from a laser, and suitable labels therefor.

In one class of techniques for sequencing DNA, identical strands of DNA are marked with a fluorescent dye. The strands are marked by attaching specially synthesized fluorescent oligonucleotide primers or probes to the strands of DNA, or by attaching the fluorescent dye directly to the strands.

The strands are separated into four aliquots. The strands in a given aliquot are either individually cleaved at or synthesized to any base belonging to only one of the four base types, which are adenine, guanine, cytosine, and thymine (hereinafter A, G, C and T). The adenine, guanine, cytosine, and thymine terminated strands are then electrophoresed for separation and the separated strands are irradiated by a laser and the emission from the fluorescent dye detected. The rate of electrophoresis indicates the DNA sequence.

In a prior art sequencing technique of this class, the fluorescent dyes used as markers have their maximum emission spectra in the visible range, the DNA is subject to irradiation in the visible spectra, and visible spectra detectors and light sources are used. Generally photomultipliers tubes (PMT) are used for detection.

The prior art techniques for DNA sequencing have several disadvantages such as: (1) because the dyes have their emission spectra in the visible region of light spectrum, the lasers used to excite the fluorescent markers, and under some circumstances, the detectors for the light tend to be expensive; and (2) they are relatively noisy due to the background interferences by biomolecules.

Cyanine dyes are known to absorb far-red (600-700 nm) and near-infrared (700-1200 nm) light and techniques for the synthesis of derivatives of the cyanine dyes are known. However, obtaining chromophores with absorbance and emission bands that reduce the effect of background noise during gel electrophoresis apparatus when irradiating with a diode laser scanning has been difficult. Other dyes, not described herein, exhibit absorbance in wavelengths greater than 1200 nm (infrared) and would also provide discrimination against background noise.

Suitable types of cyanine dyes include heptamethine cyanine dyes. Cyanine dyes have traditionally been synthesized by a condensation reaction between a heterocyclic base containing an activated methyl group and an unsaturated bisaldehyde or its equivalent, usually as Schiff base in the presence of a catalyst. Sodium acetate has most frequently been used as a catalyst. In addition to ethanol, solvents such as acetic acid and/or acetic anhydride have also been commonly used as in the synthesis of heptamethine pyrylium dyes. This procedure suffers from several disadvantages, such as for example: (1) the purification of the product is very difficult because of the side products due to aniline; (2) the use of a catalyst interferes with the purity of the product and warrants repeated purification; (3) the reaction is generally faster and cannot be employed for the synthesis of nonsymmetric dyes in one pot; and (4) the scaling up of the reaction products to larger gram quantities leads to several additional problems resulting in poor quality and yield.

SUMMARY OF THE INVENTION

Accordingly, it is an object of the invention to provide a novel technique for DNA sequencing.

It is a further object of the invention to provide novel equipment and methods for the sequencing of far red, near infrared, and infrared fluorescent labeled DNA and the detection of the DNA after irradiation by light from laser diodes.

It is a still further object of the invention to provide novel dyes that fluoresce in the far red, near infrared, or infrared region.

It is a still further object of the invention to provide a new and novel method of synthesizing dyes having certain characteristics.

It is a still further object of the invention to provide novel dyes that fluoresce in the far red, near infrared, or infrared region in selected wavelengths when attached to oligonucleotides and that have sufficient quantum yield to make detection feasible.

It is a still further object of the invention to provide a novel probe or primer containing a dye that fluoresces in the far red, near infrared, or infrared region.

It is a still further object of the invention to provide a technique for continuous sequencing of DNA.

It is a still further object of the invention to provide a novel technique for continuously sequencing DNA using fluorescent detection.

It is a still further object of the invention to provide a novel technique for DNA sequencing using a novel fluorescent marker fastened to the DNA.

It is a still further object of the invention to provide a novel technique for continuously sequencing DNA marked with fluorescence which more clearly distinguishes marked DNA fragments from background fluorescence.

It is a still futher object of the invention to provide a novel fluorescent marker, method of synthesizing the marker and method of attaching the marker to DNA.

It is a further object of the invention to provide a novel method, marker, and equipment that permits the detection of marked molecules, including DNA molecules, using diode laser sources and diode detectors.

It is a further object of this invention to provide novel fluorescent dyes.

In accordance with the above and further objects of the invention, the sequencing of far red, near infrared, and infrared fluorescent labeled DNA and the detection of the DNA after irradiation by far red, near infrared, or infrared light from a laser diode is accomplished using a novel label. The strands of DNA are continuously electrophoresed and identified for any of several purposes, such as for example: (1) DNA sequencing; and (2) analysis of strands varying in length as prepared by such techniques as restriction enzyme cutting or polymerase chain reaction (PCR).

To aid in identification, the strands are marked with fluorescent labels that emit light in the far red, near infrared, or infrared region. The strands are irradiated with light in the far red, near infrared, or infrared region and the light emitted from the fluorescent labels is detected and used to obtain information about the DNA strands.

The marking is by direct labeling of fluorescent markers to the strands or by fluorescently labeled probes or primers hybridized to the separated strands. In one embodiment, the strands are detected by scanning with a far red, near infrared, or infrared laser diode light source.

The scanning apparatus includes a laser diode that emits light of a wavelength near or in the optimum absorption spectrum of the marker and a diode detector that detects the light near or in the optimum emission spectrum of the marker. In the preferred embodiment, the diode laser irradiates channels of DNA strands with near infrared light having a wavelength that matches the absorbance region of the marker and the detector includes an avalanche photodiode sensitive to the near infrared light emission of the marker. The scanner may include a filtering system having a pass band suitable for passing selectively the optimum emission of the fluorescent marker to the light sensor.

The photodiode, photomultiplier, or other light detector selectively detects the fluorescence, using techniques which enhance the signal/noise ratio. One technique is to modulate the laser source by pulsing the electrical current driving the laser, with detection through a lock-in amplifier. The detection is made in a wavelength band including the high emission spectrum of the fluorescent marker.

To mark the DNA strand, a novel dye having the desired properties is synthesized by a novel route or a known dye is modified. In the preferred embodiment, the novel dye has a preferred spectrum, has high absorption and fluorescence properties and has at least one reactive group enabling coupling to biomolecules such as DNA, proteins, and antibodies.

The dye is synthesized, modified from a known dye or selected to have an absorbance band and an emission band within a region encompassing the far red, near infrared, or infrared region when attached to a probe, primer, oligonucleotide or other molecule. The dye should provide high quantum yield in an optical band selected to reduce background noise. The preferred dyes for many applications calling for the labelling of biomolecules are cyanine dyes having an NCS group on the dye that will react with the amino group of the biomolecule to form a thiourea linkage or having carboxyl groups which can be converted to NHS esters that will react with the amino group of the biomolecule to form a stable amide linkage or hydroxyl groups for the purpose of reacting with a biomolecule to form stable carbamate linkages through NHS ester activation.

The preferred dyes are pentamethine or heptamethine cyanines which efficiently absorb light having wavelengths in the region of 630 to 900 nm. This wavelength is suitable for reducing background fluorescence in DNA sequencing and corresponds to the radiation wavelength of the diode lasers made of such materials as GaAlAs, GaAs, InGaAlP, GaInP, AlGaAs, AlGaInP, GaAlP, InGaAsP, GaInP/AlInP, InGaP/InGaAlP, or GaInP/AlGaInP. The GaAlAs diode, for example, emits light at wavelengths in the region of 780-800 nm and is used for scanning the gel electrophoresis sandwich used for DNA seqencing.

In one method for the synthesis of a suitable heptamethine cyanine dyes, a mixture of N-alkyl-substituted quaternary salts derived from 2,3,3-trimethylindole or 2,3,3-trimethylbenzindole and 2-chloro-1-formyl-3-(hydroxymethylene)cyclohex-1-ene is heated to ref lux in a mixture of 1-butanol and benzene (7:3) as solvent. No catalyst is used, and water is removed as an azeotrope. The resulting chloro compounds possess strong absorption and fluorescence properties in the near-infrared region and are converted to dyes bearing reactive functionalities such as hydroxy and isothiocyanate groups to attach the dye molecule nucleic acids. The dyes, thus modified are useful as fluorescent tags for nucleic acids and proteins. Several symmetric and nonsymmetric dyes have been synthesized in high yields.

In another embodiment, the far red, near infrared, or infrared dyes are selected to provide at least four dyes with each dye having its excitation and/or radiation spectra spaced sufficiently from each other dye so the flourescence from each dye can be distinguished from each other dye either by the wavelength that excites it into fluorescence or the wavelength at which it fluoresces or both, or by the fluorescence lifetime of the dye. The wavelength spacing is maintained sufficiently close to be excited by laser diodes. The dyes may be incorporated in probes and primers for attachment to oligonucleotides by such methods as described, for example, in Ruth, Jerry L. (1984) DNA 3, 123.

From the above summary, it can be understood that the techniques for the sequencing of fluorescence labeled DNA of this invention have several advantages, such as: (1) because the dyes have their emission spectra in the far red, near infrared, or infrared light spectrum, small inexpensive diode lasers may be used; and (2) this wavelength region is characterized by relatively low background fluorescence in glass, and, therefore, less noise in the received signal.

SUMMARY OF THE DRAWINGS

The above noted and other features of the invention will be better understood from the following detailed description when considered with reference to the accompanying drawings in which:

FIG. 1 is a perspective view of an embodiment of sequencer usable in the invention;

FIG. 2 is a sectional view taken through lines 2--2 of FIG. 1;

FIG. 3 is a sectional view of a portion of FIG. 1 taken through lines 3--3;

FIG. 4 is an exploded perspective view of a portion of the embodiment of FIG. 2;

FIG. 5 is an enlarged view, partly broken away, of a portion of the embodiment of FIG. 2;

FIG. 6 is a block diagram of a circuit that may be used for coordination of a sensor, scanner drive, and laser used; and

FIG. 7 is an embodiment of electrophoresis apparatus that may be used instead of the gel sandwich used in the embodiments of FIGS. 1-6.

DETAILED DESCRIPTION

The sequencing of far red, near infrared, or infrared fluorescence labeled DNA and the detection of the DNA after irradiation by far red, near infrared, or infrared light from a laser diode is accomplished using a far red, near infrared, or infrared label prepared for this purpose and either directly attached to the DNA or attached to probes or primers that will be attached to the DNA. In this specification the word "infrared" will be used at times to include far red wavelengths (600-700 nm), near infrared (700-1200 nm) and infrared (1200-4000 nm). The strands of DNA are continuously electrophoresed and identified for any of several purposes, such as for example: (1) DNA sequencing; and (2) analysis of strands varying in length as prepared by such techniques as restriction enzyme cutting or polymerase chain reaction (PCR).

The strands are marked with fluorescent labels that have their maximum fluorescense and their maximum absorbance at wavelengths of light in the far red, near infrared, or infrared region. The strands are irradiated with light in the far red, near infrared, or infrared region from a laser diode and the light emitted from the fluorescent labels is detected and used to obtain information about the DNA strands. The detector includes a light sensor which is preferably an avalanche photodiode sensitive to the infrared light emission of the marker. It may include a filtering system having a pass band suitable for passing selectively the optimum emission of the fluorescent marker to the light sensor.

To mark the DNA strand, a dye is prepared having the desired properties or a known dye is modified. In the preferred embodiment a novel dye having the preferred spectrum, high absorption and fluorescence properties, and at least one reactive group enabling coupling to biomolecules such as DNA, proteins, and antibodies is synthesized.

The dye is synthesized, modified from a known dye or selected to have an absorbance band and an emission band within a region encompassing the far red, near infrared, or infrared region when attached to a probe, primer, oligonucleotide or other molecule. The dye should provide high quantum yield in an optical band selected to reduce background noise. The preferred dyes for many applications calling for the labelling of biomolecules are cyanine dyes having an NCS group on the dye that may react with the amino goup of the biomolecule to form a thiourea linkage.

In the preferred embodiment, cyanine dyes are synthesized. The preferred dyes are heptamethine cyanines which efficiently absorb light having wavelengths in the region of 750 to 820 nm (nanometers) (maximum absorbance wavelength). This wavelength is suitable for reducing background fluorescence in DNA sequencing and corresponds to the radiation wavelength of the GaAlAs diode laser which is 780-800 nm. The GaAlAs diode is used for irradiating the gel electrophoresis sandwich, column, or capillary used for DNA seqencing. Formulas 1-13 are typical synthesized or proposed dyes and formula 14 is a suitable modified dye.

Formula 1 shows synthesized cyanine dyes having NCS (isothiocyanate) as a reactive group for attachment of a biomolecule. In this embodiment, when X is H, the maximum absorbance wavelength is 787 nm in methanol and 801 nm in DMSO, and the maximum emission wavelength is 807 nm in methanol and 818 nm in DMSO. When X is --OCH₃, the maximum absorbance wavelength is 786 nm in methanol and the maximum emission wavelength is 806 nm in methanol. In both cases, the quantum yield is greater than 15%.

Formula 2 shows synthesized cyanine dyes which have a high quantum yield in methanol around 35%. When R is ethyl or sulfonatobutyl and n is either 1 or 2, the maximum absorbance wavelength is between 762 and 778 nm. Depending on the solvent, the maximum emission wavelength is between 782 and 800 nm.

In the synthesized dye represented by formula 3, the maximum absorbance wavelength is between 773 and 778 nm, depending on the solvent and the maximum emission wavelength is between 789 and 806 nm, depending upon the solvent. The quantum yield is between 25% and 35%, depending upon the solvent.

In the molecule shown by formula 4, the dye contains a functional hydroxy group which either can be converted to NHS ester to permit coupling to DNA strands for the purposes of sequencing DNA or directly phosphitylated for use as a dye-labeled amidite for use in DNA synthesis.

In the synthesized dyes shown by formula 5, an increase in quantum yield is obtained by coupling the nitrogen groups on the two sides of the heterocyclic base by a polymethylene group including a number of methylene monomers chosen to maintain the distance between them. Generally, the methylene chain is between four and ten groups long, with nine being preferable. Similarly, ethers may be used with the same number of groups. The ethers are more stable. A functional group may be attached to provide convenient coupling to biomolecules.

A proposed series of molecules represented by formula 6 is expected to have increased solubility to permit its use in an aqueous medium through the use of an amino group or ether group attached to the benzene ring. In this series, X may be oxygen or NH, Y may be NCS (isothiocyanate) or H, and R may be ##STR1## H, NCS, CH₂ OH, CH₂ NCS or COOH according to tables 1 or 2.

In formula 7, representing synthesized dyes: R₁ may be ethyl, 4-sulfonatobutyl, 3-aminopropyl, phthalimidobutyl, hexyl and pentyl carboxylates; Z may be S, O, CMe₂, R₂ may be H, SO₃ Na, SO₃ ⁻ Et₃ ⁺ NH, OCH₃, NO₂, CO₂ Na, or CO₂ -Et₃ +NH and X may be H, N(CH₂ CO₂ CH₃)₂, CH₂ CH₂ OH, NCS, CH₂ CH₂ CH₂ OH, or --(CH₂)n--OH where the number n is any number between and including 8-12.

One series of synthesized molecules is that set of formula 7 where R₁ is 4-sulfonatobutyl and Z is CMe₂. In this series of synthesized molecules:

(1) When X and R₂ are hydrogen, the absorbance wavelength is 767 nanometers, the emission wavelength is 787 nanometers and a high quantum yield of 29% in methanol is obtained.

(2) When X is equal to MeO and R₂ is hydrogen, the absorbance wavelength is about 768 in both water and methanol nanometers and the emission wavelength is 789 nm. The quantum yield in methanol is 35%.

(3) When X is a nitro group and R₂ is hydrogen, the absorbance wavelength is between 770 and 774 nanometers in water and methanol and the emission wavelength is between 792 and 798 nm wavelengths in water and methanol, with a high quantum yield of 27% in methanol.

(4) When X is NCS and R₂ is hydrogen (shown in formula 8), the absorbance wavelength is about 769 nanometers in both water and methanol and the emission wavelength is 788 nm in water and methanol with a high quantum yield of 38% in methanol.

(5) When X is NCS and R₂ is OMe, the absorbance maximum wavelength is between 790 and 796 nm in water and methanol and the maximum emission wavelength is at 814 nm in water and methanol. The quantum yield is low.

The series of molecules represented by formula 9 have been synthesized and some are effective as IR labels when R₁, R₂ and X are as shown in table 3.

The series of molecules represented by formula 10 have been synthesized and are effective as IR labels when R₁ and X are as shown in table 4.

The series of molecules represented by formula 11 have been synthesized and are effective as IR labels when R₁, R₂ and X are as shown in table 5.

The series of molecules represented by formula 12 have been synthesized and are effective as IR labels when R₁, R₃ and R₂ are as shown in table 6.

The series of molecules represented by formula 13 have been synthesized and are effective as IR labels when R and X are as shown in table 7.

In another embodiment, the infrared dyes are selected to provide at least two dyes with each dye having its exitation and/or radiation spectra spaced sufficiently from each other dye so the fluorescence from each dye can be distinguished from each other dye either by the wavelength that excites it into fluorescence, or the wavelength at which it fluoresces or both, or by the fluorescence lifetime of the dye. The wavelength spacing is maintained sufficiently close to be excited by laser diodes. The dyes may be incorporated in probes and primers for attachment to oligonucleotides by such methods as described, for example, in Ruth, Jerry L. (1984) DNA 3, 123. The dyes may be newly synthesized or prepared by modifying existing, commercially available dyes.

There are many dyes suitable for such modification such as for example: (1) 3,3'-Diethylthiadicarbocyanine Iodide; (2) 3,3'-Diethylthiatricarbocyanine Perchlorate; (3) 3,3' Diethyloxatricarbocyanine Iodide; (4) 1,1',3,3,3'-Hexamethylindotricarbocyanine Perchlorate; ##STR2##

                  TABLE 1                                                          ______________________________________                                         Dye formula 6 series, when X = O (oxygen)                                        X               Y            R                                               ______________________________________                                         O             NCS          H                                                     O H NCS                                                                        O H CH.sub.2 OH                                                                O H CH.sub.2 NCS                                                               O H COOH                                                                     ______________________________________                                    

                  TABLE 2                                                          ______________________________________                                         Dye formula 6 series, when X = NH                                                X               Y            R                                               ______________________________________                                         NH            NCS          H                                                     NH H NCS                                                                       NH H CH.sub.2 OH                                                               NH H CH.sub.2 NCS                                                              NH H COOH                                                                    ______________________________________                                          ##STR3##

                  TABLE 3                                                          ______________________________________                                         R.sub.1          R.sub.2      X                                                ______________________________________                                         --C.sub.4 H.sub.8 SO.sub.3.sup.- (Na.sup.+)                                                     H            NCS                                                --C.sub.4 H.sub.8 SO.sub.3.sup.- (Na.sup.+) H OCH.sub.3                        --C.sub.4 H.sub.8 SO.sub.3.sup.- (Na.sup.+) H NO.sub.2                         --C.sub.4 H.sub.8 SO.sub.3.sup.- (Na.sup.+) H H                                --C.sub.2 H.sub.5 (I.sup.-) H OCH.sub.3                                        --C.sub.2 H.sub.5 (I.sup.-) OCH.sub.3 OCH.sub.3                                --C.sub.4 H.sub.8 SO.sub.3.sup.- (Na.sup.+) OCH.sub.3 OCH.sub.3                                             --C.sub.4 H.sub.8 SO.sub.3.sup.- (Na.sup.+)                                    OCH.sub.3 NCS                                     --C.sub.3 H.sub.6 NCS(Br.sup.-) H H                                            C.sub.2 H.sub.5 (I) H NCS                                                      C.sub.4 H.sub.8 SO.sub.3.sup.- (Na.sup.+) SO.sub.3.sup.- Na.sup.+ NCS                                       C.sub.2 H.sub.5 (I.sup.-) SO.sub.3.sup.-                                      Na.sup.+ NCS                                     ______________________________________                                    

                  TABLE 4                                                          ______________________________________                                         R.sub.1               X                                                        ______________________________________                                         --C.sub.2 H.sub.5 (I.sup.-)                                                                          OPhNCS                                                     --C.sub.4 H.sub.8 SO.sub.3.sup.- (Na.sup.+) OPhNCS                             --C.sub.3 H.sub.6 NCS(Br.sup.-) OPh                                            phthalimido butyl OPhNCS                                                     ______________________________________                                    

                  TABLE 5                                                          ______________________________________                                         R.sub.1        R.sub.2      X                                                  ______________________________________                                         --C.sub.3 H.sub.6 NCS(Br.sup.-)                                                               --C.sub.4 H.sub.8 SO.sub.3 H                                                                Cl                                                   --C.sub.3 H.sub.6 NCS(Br.sup.-) --C.sub.2 H.sub.5 Cl                           --C.sub.3 H.sub.6 NCS --C.sub.4 H.sub.8 SO.sub.3.sup.- OPh                     --C.sub.3 H.sub.6 NCS(Br.sup.-) --C.sub.2 H.sub.5 OPh                          C.sub.3 H.sub.6 NCS C.sub.4 H.sub.8 SO.sub.3.sup.- OPh(pMeO)                 ______________________________________                                    

                  TABLE 6                                                          ______________________________________                                         R.sub.1         R.sub.3     R.sub.2                                            ______________________________________                                         C.sub.2 H.sub.5 (I.sup.-)                                                                      H           H                                                    (CH.sub.2).sub.4 SO.sub.3.sup.- (Na.sup.+) H H                                 C.sub.2 H.sub.5 (I.sup.-) COOCH.sub.3 H                                        C.sub.2 H.sub.5 (I.sup.-) COOCH.sub.3 --(CH).sub.4 --                          (CH.sub.2).sub.4 SO.sub.3.sup.- (Na.sup.+) COOCH.sub.3 --(CH).sub.4          ______________________________________                                                                     --                                                  ##STR4##

                  TABLE 7                                                          ______________________________________                                                R         X                                                             ______________________________________                                                C.sub.2 H.sub.5 (I.sup.-)                                                                O                                                               (CH.sub.2).sub.4 SO.sub.3.sup.- (Na.sup.+) O                                   C.sub.2 H.sub.5 (I.sup.-) S                                                    (CH.sub.2).sub.4 SO.sub.3.sup.- (Na.sup.+) S                                 ______________________________________                                          (5) 1,1'-Diethyl-2,2'-dicarbocyanine Iodide; (6)      3,3'-Diethylthiadicarbocyanine Iodide; (7) 3,3'-Diethyloxatricarbocyanine      Iodide; (8) 1,1',3,3,3',3'-Hexamethylindotricarbocyanine Perchlorate; (9)      1,1',3,3,3',3'-Hexamethylindotricarbocyanine Iodide; and (10) Indocyanine      Green.

In one embodiment, the dye has the chemical structure shown in formula 14, with R being --CH₂ CH₃. This dye is close to having the desired wavelength of maximum fluorescence and the wavelength of maximum absorbance may be modified by changing the functional group R. The unmodified dye may be obtained from Laboratory and Research Products Division, Eastman Kodak Company, Rochester, N.Y. 14650. It is advertised in the Kodak laser dyes, Kodak publication JJ-169.

The modifications with 1,3-propanediol can be made in a manner known in the art as illustrated by equation 1. For example, changes occur when different esters are formed replacing the ethyl alcohol in the original dye molecule (R equal --CH₂ --CH₃ of formula 14). If different glycol esters are formed, absorbance maxima of these new near infrared dyes shift to the longer wavelengths.

The absorbance maximum is dependent on the distance of the oxygen atoms in the glycol functional group. However, the fluorescence maxima of these new near infrared dyes are practically at same wavelength of the dye of formula 14, i.e. 819 nm. This indicates that only the excitation process has changed, i.e. only the energy level to which the transition occurs during absorption has changed. The lowest vibronic level of first excited state remains unchanged. The absorbance maxima of such esters derived from several glycols are: (1) ethylene glycol 796 nm (nanometers); (2) 1,3-Propanediol 780 nm; (3) 1,4-Butanediol 754 nm; (4) 1,6-Hexanediol 745 nm; (5) Triethylene glycol 790 nm; (6) ethylene thiol 810 nm; and (7) IR-144 (R=CH₂ CH₃) 742 nm.

In the preferred embodiment, the fluorescence maximum wavelength is about 819 nanometers and the detector is adjusted to receive this wavelength and not others by appropriate filtering. The absorbance maxima is selected to be different and to correspond to the preferred available laser diode emission. For example, in formula 14, R may be any of the following seven groups, depending on the desired wavelength of the emitted light, which are:

(1) --CH₂ CH₂ --OH for an absorbance wavelength of 796 nanometers;

(2) --CH₂ CH₂ CH₂ --OH for an absorbance wavelength of 780 nanometers;

(3) --CH₂ CH₂ CH₂ CH₂ OH for an absorbance wavelength of 754 nanometers;

(4) --CH₂ CH₂ CH₂ CH₂ CH₂ CH₂ --OH for an absorbance wavelength of 745 nanometers;

(5) --CH₂ CH₂ --O--CH₂ CH₂ --O--CH₂ CH₂ --OH for an absorbance wavelength of 790 nanometers;

(6) --CH₂ CH₂ --SH for an absorbance wavelength of 810 nanometers; and

(7) --CH₂ CH₃ for an absorbance wavelength of 742 nanometers.

More specifically, in the uncatalyzed synthesis of new heptamethine cyanine dyes which are potential precursors for making functionalized near-infrared labels, a mixture of a quaternary salt of a heterocyclic base containing an activated methyl group (2 equiv) and 2-chloro-1-formyl-3-(hydroxymethylene)cyclohex-1-ene (2), an unsaturated bisaldehyde derived from cyclohex-anone (1 equiv) is ##STR5## heated to ref lux in a mixture of 1-butanol and benzene (7:3) as solvent, without using any catalyst. The water formed during the reaction is removed as an azeotrope by a Dean-Stark condenser. The reactions generally require 3-12 h for completion. The resulting product is generally pure after simple filtration of the dye from the crude reaction mixture followed by washings with diethyl ether. A wide range of 2-methy-1-alkyl quaternary salts of various indole and benzoindole derivatives undergo this reaction in a facile manner to form the corresponding symmetric dyes. An important feature of the current method is that the slower rate of the reaction allows one to prepare nonsymmetric dyes derived from two different heterocycles in a single pot in a fairly good yied. The syntheses of these nonsymmetric dyes become important when changes in the spectral and physical properties of the dyes are desired for specific application and compatibility with instrumentation. These changes can be incorporated by appropriately modifying the structural design of the dyes.

Dye synthesis is illustrated by chart 1, which shows the synthesis of formulae 2, 3, 7, 8, and 9. In the chloro intermediate in chart 1, R₁ and R₂ may have the values shown in table 9. In chart 1, X₁ is a halogen. Chart 2 shows the synthesis of formula 10. Chart 3 shows the synthesis of formulae 1, 4 and 11. Chart 4 shows the synthesis of formula 12. Chart 5 shows the synthesis of formula 13. In chart 5, X is oxygen or sulfur.

In FIG. 1, there is shown a perspective view of an embodiment of sequencer in which the method of this invention may be performed. This sequencer is described in structure and operation in the aforementioned U.S. patent application Ser. No. 07/570,503 filed Aug. 21, 1990; U.S. patent application Ser. No. 07/078,279 filed Jul. 27, 1987; and U.S. Pat. No. 4,729,947, all of which are entitled DNA SEQUENCING and which were filed by Middendorf et al.

In FIG. 2, there is shown a sectional view of the remote station 122A taken through section lines 2--2 of FIG. 1 having an electrophoresis section 140, a scanning section 142, an electrophoresis power supply 144, a system power supply section 144A, an analog board 146 and a digital board 148. The electrophoresis section 140 is positioned near the front of the cabinet and a portion of it is adapted

                  TABLE 9                                                          ______________________________________                                                R.sub.1       R.sub.2                                                   ______________________________________                                                C.sub.4 H.sub.8 SO.sub.3 H                                                                   H                                                           C.sub.2 H.sub.5 H                                                              C.sub.3 H.sub.6 NH.sub.3 Br.sup.- H                                            C.sub.4 H.sub.8 SO.sub.3 H SO.sub.3.sup.- Na.sup.+                             C.sub.2 H.sub.5 SO.sub.3.sup.- Na.sup.+                                        C.sub.4 H.sub.8 SO.sub.3 H OCH.sub.3                                           C.sub.2 H.sub.5 OCH.sub.3                                                    ______________________________________                                          ##STR6##      to be scanned by the scanning section 142 in cooperation with circuitry on      the analog board 146 and the digital board 148. All of the apparatus are      electrically connected to the power supply section 144A for such      operation.

To separate different DNA fragments into bands, the electrophoresis section 140 includes a gel sandwich 150, an upper buffer assembly 152, support assembly 154, and a lower buffer assembly 151 positioned to enclose the bottom of the gel sandwich 150. In the embodiment of FIG. 2, the gel sandwich 150 is held vertically and its temperature is controlled during operation. Bands are separated by applying voltage to the upper buffer assembly 152 and lower buffer assembly 151 and scanned by the scanning section 142.

To support the gel sandwich 150, the support assembly 154 includes a pair of upper side brackets and lower side brackets 160 and 162 (only one of each pair being shown in FIG. 2), an apparatus support plate 168, a temperature control heating plate 164, and a plastic spacer, shown at 166A-166C, in FIG. 2. The entire structure is supported on the apparatus support plate 168 which mounts the upper and lower side brackets 160 and 162.

The upper and lower side brackets 160 and 162 are each shaped to receive a pin (not shown) extending from one or the other side of the gel sandwich 150 and hold it in place in juxtaposition with the scanning section 142. The spacers as shown as 166A-166C space the temperature control heating plate 164 from the apparatus support plate 168 and maintain it at a constant selected temperature above ambient temperature. In the preferred embodiment, the temperature is maintained at 45-50 degrees Centigrade and should be maintained in a range of 30 degrees to 80 degrees.

The scanning section 142 includes a laser diode assembly (not shown in FIG. 2), a microscope assembly 172, a photodiode section 174 and a scanner mounting section 176. The laser diode assembly (not shown in FIG. 2) is positioned at an angle to an opening in the heating plate 164 so that light impinges on the gel sandwich 150 to cause fluorescence with minimum reflection back through the microscope assembly 172.

To receive the fluorescent light, the microscope assembly 172 is focused on the gel sandwich 150 and transmits fluorescent light emitted therefrom into the photodiode section 174 which converts it to electrical signals for transmission to and processing by the analog and digital boards 146 and 148 which may provide further analysis of data. The scanning section 142 moves along a slot in the apparatus support plate 168 which is mounted to the scanner mounting section 176 during this .pa operation in order to scan across the columns in the gel sandwich 150.

The scanner mounting section 176 includes a mounting plate 180, a bearing plate 182, a stepping motor 184, a slidable support 186 and a belt and pully arrangement 185, 188A and 188B. The mounting plate 180 is movably mounted to the apparatus support plate 168 through a frame member and supports the elongated bearing plate 182, the stepping motor 184 and two pulleys 188A and 188B. The elongated bearing plate 182 extends the length of the gel sandwich 150.

To permit motion of the laser diode assembly (not shown) and microscope assembly 172 with respect to the gel sandwich 150, the slidable support 186 supports the microscope assembly 172 and diode assembly and slidably rests upon the bearing plate 182. An output shaft 183 of the stepping motor 184 drives a pulley 188B through pulley 188, belt 185, and pulley 188A and the pulley 188B drives a belt (not shown) that is clamped to the slidable support 186 to move it the length of the gel sandwich 150 during scanning by the laser diode and microscope assembly 172 which rest upon it. The stepping motor 184 under the control of circuitry in the digital board 148 moves the pulley 188B to move the belt (not shown) and thus cause scanning across the gel sandwich 150.

As shown in this view, the electrophoresis power supply 144 is electrically connected to buffer in the upper buffer assembly 152 through an electrical connector 194 and to the lower buffer assembly 151 through a connector not shown in FIG. 2.

The upper buffer assembly 152 includes walls 197 forming a container to hold a buffer solution 195 and a cover 199 formed with a lip to fit over the walls 197 from the top and containing a downwardly extending flat member spaced away from the side walls and holding a conductor 211. The conductor 211 is electrically connected to the source of power through connector 194 which is mounted to the top of the cover 199 to permit electrical energization of the buffer solution 195.

The bottom buffer assembly 151 includes enclosed walls 201 defining a container for holding a buffer solution 203 and a cover 205 closing the container 201 and having a downwardly extending portion 213 extending into the buffer 203 for supporting a conductor 207 for applying energy to the bottom buffer solution 203. The gel sandwich 150 extends downwardly into the buffer solution 203 and upwardly into the buffer solution 195 to permit the electrical contact for electrophoresis. An "o" ring 197B provides a seal for the upper buffer assembly 152 so that the buffer solution 195 does not empty out of the upper buffer assembly 152.

In FIG. 3, there is shown a sectional view taken through lines 3--3 of FIG. 1 showing a portion of the electrophoresis section 140, a portion of the scanning section 142 (indicated twice in FIG. 3 for clarity) and the electrophoresis power supply section 144 (FIG. 2 only) mounted together to illustrate, from a top view, the arrangement of the apparatus support plate 168, the heater plate 164, the gel sandwich 150, a laser diode assembly 170, a microscope assembly 172 and a photodiode assembly 174. The heater plate 164 and apparatus support plate 168 have slots running in a horizontal direction orthogonal to the lanes of DNA in the electrophoresis section 140 sized to receive the ends of the laser diode assembly 170 and the microscope section 172 for scanning thereof.

To cooperate with the separation and scanning of DNA bands, the gel sandwich 150 includes a front glass plate 200, a gel section 202 and a rear glass plate 204 mounted in contact with the heater plate 164 and having a section exposed for scanning by the laser diode assembly 170 and the microscope assembly 172. The rear glass plate 204 contacts the heater plate 164 and is separated from the front glass plate 200 by the gel section 202 within which DNA separation takes place. The front and rear glass plates 200 and 204 may be any type of glass but are preferably soda lime which has low fluorescence in the far red and near infrared regions and is prepared by a process that provides optically flat surfaces without grinding.

To transmit light to the gel sandwich 150, the laser diode assembly 170 includes a housing 210, a focusing lens 212, a narrow band pass filter 214, a collimating lens 216 and a laser diode 218. The laser diode 218 emits far red, near infrared, or infrared light which is collimated by the laser collimating lens 216 and filtered through the narrow band pass filter 214. This light is focused by the focusing lens 212 onto the gel sandwich 150. Preferably, the point of focus on the gel section 202 of the gel sandwich 150 lies along or near the central longitudinal axis of the microscope section 172 and the photodiode section 174.

The thickness of the glass plates and the gel, the position of the laser and microscope assembly 172 and thus the angle of incidence and angle of reflection of the light from the laser and to the microscope assembly 172 are chosen, taking into consideration the refractive index of the gel and glass and the thickness of the glass plates and the gel, so that the light from the laser is maximally transmitted to the gel. The light from the laser is not directly reflected back because the angle of incidence to normal is equal to the Brewster's angle at the first interface and is such as to impinge on the markers with full intensity after refraction but not be reflected by the first surface of the gel sandwich 150 into the microscope assembly 172 and the microscope assembly 172 views those markers that fluoresce in its line of sight.

To maintain temperature control over the laser diode, the housing 210: (a) is coupled to a heat sink through a thermal electric cooler 220, and (b) encloses the focusing lens 212, narrow band pass filter 214, collimating lens 216 and laser diode pa 218; and (c) accommodates the electrical leads for the diode.

To receive and focus light emitted by fluorescent markers from the gel section 202 in response to the light from the laser diode assembly 170, the microscope assembly 172 includes a collection lens 230, a housing 232 and a focusing motor. The microscope assembly 172 is adapted to be positioned with its longitudinal axis centered on the collection lens 230 and aligned with the photodiode section 174 to which it is connected. For this purpose, the housing 232 includes a central passageway in which are located one or more optical filters (not shown) with a pass band matching the emission fluorescence of the marked DNA strands. With this arrangement, the collection lens 230 receives light from the fluorescent material within the gel section 202 and collimates the collected light for optical filtering and then transmission to the photodiode assembly 174.

To generate electrical signals representing the detected fluorescence, the photodiode assembly 174 includes a housing 240 having within it, as the principal elements of the light sensors, an inlet window 242, a focusing lens 244, a sapphire window 246 and an avalanche photodiode 248. To support the avalanche photodiode 248, a detector mounting plate 250 is mounted within the housing 240 to support a plate upon which the avalanche photodiode 248 is mounted. The inlet window 242 fits within the housing 240 to receive light along the longitudinal axis of the photodiode assembly 174 from the microscope assembly 172.

Within the housing 240 of the photodiode assembly 174, the sapphire window 246 and avalanche photodiode 248 are aligned along the common axis of the microscope assembly 172 and the photodiode assembly 174. The focusing lens 244 focuses light transmitted by the microscope assembly 172 onto a small spot on the avalanche photodiode 248 for conversion to electrical signals. A thermoelectric cooler 252 utilizing the Peltier effect is mounted adjacent to the detector mounting plate 250 to maintain a relatively cool temperature suitable for proper operation of the avalanche photodiode 248.

As best shown in this view, the stepping motor 184 rotates the belt 185 to turn the pulley 188A, which, in turn, rotates pulley 188B. The pulley 188B includes a belt 177 extending between it and an idler pulley 179 and attached at one location to the slideable support 186 (shown only in FIG. 2) to move the scanning microscope and laser lengthwise along the gel sandwich 150 for scanning purposes. The motor 184, by moving the carriage back and forth, accomplishes scanning of the gel sandwich 150.

In FIG. 4, there is shown a fragmentary perspective view of the gel sandwich 150 and the upper buffer assembly 152 mounted to each other showing the outer glass plate 200 cut away from the rear glass plate 204 to expose the gel section 202 to buffer solution within the upper buffer assembly 152. With this arrangement, DNA samples may be pipetted between the glass plates 200 and 204 and moved downwardly by electrophoresis beyond the upper buffer assembly 152 and through the gel sandwich 150 to the bottom buffer (not shown in FIG. 4).

In FIG. 5, there is shown a broken away view of the gel sandwich 150 illustrating the upper buffer assembly 152 and the lower buffer assembly 151 connected to it at each end. As shown in this view, the cover 199 includes a connecting post 214 which receives the conductor 211 for connection to the downwardly extending portion of the cover 199 into the buffer compartment. Between the glass plates 200 and 204 (FIG. 4) of the gel sandwich 150, are a plurality of downwardly extending recesses 231 in the gel section 202 (FIG. 4) between the plates. DNA sample is pipetted into these recesses to form channels for electrophoresing to the lower buffer assembly 151.

To form an electrical connection through the gel sandwich 150 from the upper buffer assembly 152 to the lower buffer assembly 151, a connecting post 218 is connected to the cover 205 of the lower buffer assembly 151 for receiving the conductor 207 which extends downwardly to the downwardly extended plate 213 and into the buffer solution.

In FIG. 6, there is shown a block diagram of the circuitry used to control the remote station 122A of the embodiment of FIG. 2 having a control, correlation and readout section 250, the scanner drive 176, the motor assembly 184 for moving the scanner drive 176, and the sensing configuration 252. The sensing configuration 252 includes the laser assembly 170 and the sensor assembly 174 which receives signals, removes some noise, and transmits the signals for display and readout in the control, correlation and readout section 250, while the scanner drive 176 and motor for the scanner drive 184 receive signals from the control, correlation and readout section 250 to control the motion of the sensor back and forth across the gel sandwich. This overall configuration is not part of the invention of this application except insofar as it cooperates with the sensing configuration 252 to scan the DNA and determine its sequence in accordance with the embodiments of FIGS. 1-5.

To drive the sensor 174 from position to position, the motor assembly 184 includes a stepper motor 254 and a motor driver 256. The motor driver 256 receives signals from the control correlation and readout section 250 and actuates the stepper motor 254 to drive the scanner drive 176. The scanner drive 176 is mechanically coupled to a stepping motor 254 through a belt and pulley arrangement for movement back and forth to sense the electrophoresis channels on the gel sandwich 150 (FIG. 3). The stepping motor 254 and driver circuitry 256 are conventional and not themselves part of the invention.

The control, correlation and readout system 250 includes a computer which may be any standard microprocessor 260, a television display or cathode ray tube display 262 and a printer 264 for displaying and printing the results of the scans.

To sense data, the sensing configuration 252 includes, in addition to the laser 170 and the sensor 174, a chopper circuit 270, a sensor power supply 272, a preamplifier 274, a lock-in amplifier 276, a 6-pole filter 278, a 12-bit analogue to digital converter interface circuit 280 and a laser power supply 282.

The sensor 174 receives light from the laser 170 after it impinges upon the gel sandwich 150 (FIG. 3) and transmits the signals through preamplifier 274 to the lock-in amplifier 276. The sensor receives signals from the sensor power supply 272. The chopper circuit 270 provides pulses at synchronized frequencies to the lock-in amplifier 276.

The laser 170 receives power from the power supply 282 which is controlled by the chopper circuit 270 so that the signal from the laser is in synchronism with the signal applied to the lock-in amplifier 276 so that the output from the lock-in amplifier 276 to the 6-pole filter 278 discriminates against unwanted signal frequencies. This signal is converted to a digital signal in the 12-bit analogue to digital converter 280 which serves as an interface to the computer 260.

With this arrangement, the scanning rate may be set to discriminate against noise and the synchronized demodulation from the chopper control further reduces noise, particularly discriminating against the natural fluorescense of the glass in the gel sandwich 150 (FIGS. 2 and 3).

In FIG. 7, there is shown another embodiment C26A which may be substituted for the gel sandwich 150 and includes capillary columns 61, 63 and 65 as commonly used in capillary electrophoresis. These columns may be filled with buffer solution or a gel and be used for electrophoresis. In such a case, several capillaries may be used as a substitute for the gel sandwich 150 of the embodiment of FIGS. 2-5. Thus, the same band of A, G, C or T type bases might flow through several parallel bundles of capillaries, or they might flow through only one capillary per type of base; or bands of A, G, C, and T type bases may be combined to flow through only one capillary.

The separation path such as gel channels or capillary tube length should be no longer than two meters for a range of lengths of DNA from 50 to 10,000 or more bases. However, as the range of DNA lengths increase, the time required increases. Also, the time required for each separation is in the range of from 1/2 second to 5 minutes for each added base of length separation.

From the above summary, it can be understood that the techniques for the sequencing of fluorescence labeled DNA of this invention have several advantages, such as: (1) because the dyes have their emission spectra in the far red, near infrared or infrared light spectrum, small inexpensive diode lasers may be used; and (2) this wavelength region is characterized by relatively low background fluorescence in glass, and, therefore, less noise in the received signal.

Although a preferred embodiment of the invention has been described with some particularity, many modifications and variations are possible in the preferred embodiment within the light of the above description. Accordingly, within the scope of the appended claims, the invention may be practiced other than as specifically described. 

What is claimed is:
 1. A method of identifying strands of DNA comprising the steps of:marking the strands with fluorescent labels that emit light in a region of wavelengths including at least one wavelength within one of a far red, near infrared, and infrared regions wherein the fluorescent label includes a chromophore having the formula: ##STR7## Where X is H or OCH₃ ; irradiating the strands with light having the wavelength within one of the far red, near infrared, and infrared regions; and detecting the light emitted from the fluorescent labels.
 2. A method of identifying strands of DNA in accordance with claim 1 in which the step of detecting includes the step of detecting one of far red, near infrared, and infrared light emissions from the fluorescent labels.
 3. A method of identifying strands of DNA comprising the steps of:marking the strands with fluorescent labels that emit light in a region of wavelengths including at least one wavelength within one of a far red, near infrared, and infrared regions wherein the fluorescent label includes a chromophore having the formula: ##STR8## where R is ethyl or 4-sulfonatobutyl and n is either 1 or 2; irradiating the strands with light having a wavelength within one of the far red, near infrared, and infrared regions; and detecting the light emitted from the fluorescent labels.
 4. A method of identifying strands of DNA in accordance with claim 3 in which the step of detecting includes the step of detecting one of far red, near infrared, and infrared light emissions from the fluorescent labels.
 5. A method of identifying strands of DNA comprising the steps of:marking the strands with fluorescent labels that emit light in a region of wavelengths including at least one wavelength within one of a far red, near infrared, and infrared regions wherein the fluorescent label includes a chromophore having the formula: ##STR9## irradiating the strands with light having the wavelength within one of the far red, near infrared, and infrared regions; and detecting the light emitted from the fluorescent labels.
 6. A method of identifying strands of DNA in accordance with claim 5 in which the step of detecting includes the step of detecting one of far red, near infrared and infrared light emissions from the fluorescent labels.
 7. A method of identifying strands of DNA comprising the steps of:marking the strands with fluorescent labels that emit light in a region of wavelengths including at least one wavelength within one of a far red, near infrared, and infrared region wherein the fluorescent label includes a chromophore having the formula: ##STR10## irradiating the strands with light having the wavelength within one of the far red, near infrared, and infrared regions; and detecting the light emitted from the fluorescent labels.
 8. A method of identifying strands of DNA in accordance with claim 7 in which the step of detecting includes the step of detecting one of far red, near infrared, and infrared light emissions from the fluorescent labels.
 9. A method of identifying strands of DNA comprising the steps of:marking the strands with fluorescent labels that emit light in a region of wavelengths including at least one wavelength within one of a far red, near infrared, and infrared regions wherein the fluorescent label includes a chromophore having the formula: ##STR11## applying the DNA strands marked with fluorescent labels at a plurality of locations in a gel electrophoresis slab or set of capillaries for electrophoresing in a plurality of channels through the gel electrophoresis slab or set of capillaries wherein the fluorescent labels include a fluorescent dye having absorbance and fluorescense maxima at one of the far red, near infrared and infrared wavelengths when combined with DNA; establishing electrical potential across said gel electrophoresis slab or set of capillaries wherein the marked DNA strands are resolved into bands of fluorescently marked DNA strands in said gel electrophoresis slab or set of capillaries in accordance with the size of the fluorescently marked DNA strands; and irradiating the bands of fluorescently marked DNA strands with light having the wavelength within one of the far red, near infrared, and infrared regions while they are in the slab or set of capillaries with a laser diode wherein the laser diode scans with far red, near infrared, or infrared scanning light at a wavelength within an absorbance spectrum of said fluorescently marked DNA strands; and detecting the light emitted from the fluorescent labels with a sensor wherein the sensor senses light at the emission wavelength of the marked DNA. 